Picture for Fengzhuo Zhang

Fengzhuo Zhang

Annealed Relaxation of Speculative Decoding for Faster Autoregressive Image Generation

Add code
Jan 14, 2026
Viaarxiv icon

Demystifying the Slash Pattern in Attention: The Role of RoPE

Add code
Jan 13, 2026
Viaarxiv icon

Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs

Add code
May 25, 2025
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Figure 1 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 2 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 3 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 4 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Viaarxiv icon

LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification

Add code
Feb 24, 2025
Figure 1 for LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification
Figure 2 for LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification
Figure 3 for LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification
Figure 4 for LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification
Viaarxiv icon

Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory

Add code
Dec 23, 2024
Figure 1 for Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory
Figure 2 for Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory
Figure 3 for Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory
Figure 4 for Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory
Viaarxiv icon

When Attention Sink Emerges in Language Models: An Empirical View

Add code
Oct 14, 2024
Figure 1 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 2 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 3 for When Attention Sink Emerges in Language Models: An Empirical View
Figure 4 for When Attention Sink Emerges in Language Models: An Empirical View
Viaarxiv icon

Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods

Add code
Aug 25, 2024
Figure 1 for Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
Figure 2 for Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
Figure 3 for Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
Figure 4 for Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods
Viaarxiv icon

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Add code
May 30, 2024
Viaarxiv icon

Learning Regularized Graphon Mean-Field Games with Unknown Graphons

Add code
Oct 26, 2023
Figure 1 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Figure 2 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Figure 3 for Learning Regularized Graphon Mean-Field Games with Unknown Graphons
Viaarxiv icon